Automated protein (re)sequencing with MS/MS and a homologous database yields almost full coverage and accuracy
نویسندگان
چکیده
MOTIVATION The bottom-up tandem mass spectrometry (MS/MS) is regularly used in proteomics nowadays for identifying proteins from a sequence database. De novo sequencing software is also available for sequencing novel peptides with relatively short sequence lengths. However, automated sequencing of novel proteins from MS/MS remains a challenging problem. RESULTS Very often, although the target protein is novel, it has a homologous protein included in a known database. When this happens, we propose a novel algorithm and automated software tool, named Champs, for sequencing the complete protein from MS/MS data of a few enzymatic digestions of the purified protein. Validation with two standard proteins showed that our automated method yields >99% sequence coverage and 100% sequence accuracy on these two proteins. Our method is useful to sequence novel proteins or 're-sequence' a protein that has mutations comparing with the database protein sequence.
منابع مشابه
Shotgun protein sequencing with meta-contig assembly.
Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at l...
متن کاملTools for exploring the proteomosphere.
Homology-driven proteomics aims at exploring the proteomes of organisms with unsequenced genomes that, despite rapid genomic sequencing progress, still represent the overwhelming majority of species in the biosphere. Methodologies have been developed to enable automated LC-MS/MS identifications of unknown proteins, which rely on the sequence similarity between the fragmented peptides and refere...
متن کاملModification-tolerant Shotgun Protein Sequencing of a Snake Venom Proteome
Despite the steady accumulation of fully sequenced genomes for model organisms, limited or no sequence information is available for most organisms. Moreover, natural mechanisms of variation such as accelerated mutation and combinatorial recombination in immunoglobulins regularly create novel sequences in the proteomes of model organisms. However, since protein identification via database search...
متن کاملTop-Down Analysis of Small Plasma Proteins Using an LTQ-Orbitrap. Potential for Mass Spectrometry-Based Clinical Assays for Transthyretin and Hemoglobin.
Transthyretin (TTR) amyloidosis and hemoglobinopathies are the archetypes of molecular diseases where point mutation characterization is diagnostically critical. We have developed a Top-down analytical platform for variant and/or modified protein sequencing and are examining the feasibility of using this platform for the analysis of hemoglobin/TTR patient samples and evaluating the potential cl...
متن کاملCLONING AND SEQUENCING OF A MITOCHONDRIAL AUTOANTIGEN WITH IMMUNOGLOBULIN G FROM PATIENTS WITH MULTIPLE SCLEROSIS
Multiple Sclerosis (MS) is a chronic neurological disease of the central nervous system (CNS), characterised by a cellular immune response in early stages and demyelination of the CNS later. Although the cause of MS is unknown, there is much evidence that points to MS as an autoimmune disease. To test the hypotheses that an Autoantigen is involved in MS, we screened a ?gt11 human foetal spinal ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 25 17 شماره
صفحات -
تاریخ انتشار 2009